Evaluating the Price of Consistency in Distributed File Storage Services
نویسندگان
چکیده
Distributed file storage services (DFSS) such as Dropbox, iCloud, SkyDrive, or Google Drive, offer a filesystem interface to a distributed data store. DFSS usually differ in the consistency level they provide for concurrent accesses: a client might access a cached version of a file, see the immediate results of all prior operations, or temporarily observe an inconsistent state. The selection of a consistency level has a strong impact on performance. It is the result of an inherent tradeoff between three properties: consistency, availability, and partition-tolerance. Isolating and identifying the exact impact on performance is a difficult task, because DFSS are complex designs with multiple components and dependencies. Furthermore, each system has a different range of features, its own design and implementation, and various optimizations that do not allow for a fair comparison. In this paper, we make a step towards a principled comparison of DFSS components, focusing on the evaluation of consistency mechanisms. We propose a novel modular DFSS testbed named FlexiFS, which implements a range of state-of-the-art techniques for the distribution, replication, routing, and indexing of data. Using FlexiFS, we survey six consistency levels: linearizability, sequential consistency, and eventual consistency, each operating with and without close-to-open semantics. Our evaluation shows that: (i) as expected, POSIX semantics (i.e., linearizability without close-to-open semantics) harm performance; and (ii) when close-to-open semantics is in use, linearizability delivers performance similar to sequential or eventual consistency.
منابع مشابه
Flexible, wide-area storage for distributed systems using semantic cues
There is a growing set of Internet-based services that are too big, or too important, to run at a single site. Examples include Web services for e-mail, video and image hosting, and social networking. Splitting such services over multiple sites can increase capacity, improve fault tolerance, and reduce network delays to clients. These services often need storage infrastructure to share data amo...
متن کاملSCFS: A Shared Cloud-backed File System
Despite of their rising popularity, current cloud storage services and cloud-backed storage systems still have some limitations related to reliability, durability assurances and inefficient file sharing. We present SCFS, a cloud-backed file system that addresses these issues and provides strong consistency and near-POSIX semantics on top of eventually-consistent cloud storage services. SCFS pro...
متن کاملDistributed Cloud Storage Services with FleCS Containers
There are limits to the ability to migrate or deploy applications across geographically distributed/loosely coupled cloud resources, requiring substantial data movement and/or uniformly visible and accessible storage services across such distributed infrastructure. To address these issues, we propose and explore the utility of FleCS – an approach for providing FLExible Cloud Storage services in...
متن کاملFlexible, Wide-Area Storage for Distributed Systems with WheelFS
WheelFS is a wide-area distributed storage system intended to help multi-site applications share data and gain fault tolerance. WheelFS takes the form of a distributed file system with a familiar POSIX interface. Its design allows applications to adjust the tradeoff between prompt visibility of updates from other sites and the ability for sites to operate independently despite failures and long...
متن کاملAnonymity and Trust in Large-Scale Distributed Storage Systems
Large-Scale Distributed Storage Systems (LS-DSSs) are at the core of several Cloud services. These externalized services may run atop multiple administrative domains. While a client may trust the organization that provides a given Web service, a single server may belong to another organization that the client does not trust. The design of a Distributed Storage System is itself a challenging tas...
متن کامل